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The derivation of entropy for cluster methods is reformulated by constructing the probability of a given 
particle (spin) configuration as a self-consistent product of cluster configuration probabilities. This approach 
gives an insight into the nature of underlying approximations involved at different levels of the cluster- variation 
method. The graphical representation of the product allows us to extend this method for non-Bravais lattices 
as it is demonstrated on interstitial sites of body-centered- and face-centered-cubic crystals. 



in 
o 

Q 



> 

O 

o 
in 

ON 



X3 
O 

o 



X 



I. INTRODUCTION 

The cluster-variation method (CVM) is considered as 
a generalization of the traditional mean-field (Bragg- 
Wiliams [Q) theory taking only one-point configurations 
into account. The first-neighbor pair correlations were 
handled in the method developed by Bethe for square 
lattice . The accuracy of this technique were improved 
by Kramers and Wannier, who chose a square cluster on 
which the probabilities of each configuration were taken 
into consideration jsj. The variational method was gen- 
eralized by Kikuchi for different lattice structure using a 
large class of clusters 0. The above authors used com- 
binatorial methods to determine the entropy as a func- 
tion of cluster configuration probabilities. Since then the 
CVM has been reformulated by several authors. These 
formulations were motivated by the cumbersome calcula- 
tions when determining ensemble configurations for large 
clusters in three-dimensional lattices. Barker showed the 
equivalence between a generalization of quasi-chemical 
approximation and CVM Hijmans and De Boer de- 
veloped a systematic scheme for obtaining the free en- 
ergy expressed by cluster variables Q. Woodbury found 
that many of the previous results were reproducible in a 
direct manner using the composition law of information 
theory |^ . Introducing a generalized cumulant expansion 
of the entropy the CVM was reformulated by Morita Q . 
This latter formulation is strongly related to the Mobius 
inversion [p|-pl|. A systematic counting and classifica- 
tion of clusters were carried out by Gratias et al. | |T^ ] 
for general crystal structures including some non-Bravais 
lattices. Sanchez et al. developed a general formalism 
based on the description of configurational cluster func- 
tions in terms of an orthogonal basis in the multidimen- 
sional space of site variables jl^]. Several authors inves- 
tigated the convergence of CVM when using larger and 
larger clusters . 

Recently the CVM is widely used to determine phase 
diagrams in different systems (for reviews see the works 
by de Fontaine and further references therein). 

In the present work we suggest a simple way for the en- 
tropy derivation which is not related to the Gibbs for- 
malism and may be used for dynamical (non-equilibrium) 
systems too. Furthermore this method is applicable to 
non-Bravais lattices which are very important for practice 



(e.g. superionic conductors, metal-hydrogen systems, in- 
tercalation alloys, etc). For this purpose the CVM is 
reformulated by showing how to construct the probabil- 
ity of a given particle (spin) configuration as a product 
of cluster configuration probabilities taking the transla- 
tion symmetries and self-consistency into account. These 
conditions can be satisfied by using different approxi- 
mations equivalent to those mentioned above. Now we 
concentrate on the entropy per elementary cells which is 
related to the conditional entropy introduced in informa- 
tion theory [Q. As a result the entropy as well as the 
energy are expressed as a function of the probabilities of 
cluster configurations. Thus we have a free energy to be 
minimized with respect to these variables for obtaining 
thermodynamic equation of states. 

In the subsequent section the essence of this method 
is illustrated on the one-dimensional lattice. This idea is 



adapted for the two-dimensional system in Sec. III. Here, 



a graphical representation of the construction of configu- 
ration probabilities will be introduced. This technique is 
used to obtain entropy expressions for two non-Bravais 
lattices formed by the interstitial sites in body-centered- 
and face-centered-cubic lattices in Sees. IV and 0. Fi- 
nally the results are summarized in Sec. VI. 



II. ONE-DIMENSIONAL LATTICE 

In the one-dimensional model a site variable rji {i = 
1, N) denotes the state of the ith lattice point. In the 
lattice-gas formalism (rji = or 1), however, the subse- 
quent formulae remain valid for those systems character- 
ized by Q states at each site (e.g. 77^ = 0, 1, Q — 1). The 
function fpf — f{r]i,...,r]i\f) describes the probability of 
a configuration specified by the rji variables. Similarly, 
we introduce a cluster of k subsequent sites on which the 
probability of a configuration is defined by pfe(r/i, ...,77^). 
If the system is translation invariant, then 

Jn =Pk{Vj,Vj+u---,Vj+k-i) (1) 

i<j, i>J + fe-l 

for arbitrary 1 < j < N — k. Consequently, the set of 
Pkivi, ■ ■ ■ iVk) functions satisfies the following relations: 
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Pk{vi,---,Vk) ^^Pk+i{vo,Vi,- ■ - ^Vk) , (2) 

no 

Pkivi, ■■■,m)= X! Pk+iim, ■ ■ ■,Vk,vk+i) , (3) 

Vk + l 

for fc > 1 and 

The probabilities of the fc-point configurations are com- 
pletely described by introducing 2*^"^ parameters 
More precisely, the number of independent variables may 
be less than 2''^^ when additional symmetries (e.g. re- 
flection, particle-hole) are taken into consideration. 

In n-point approximation the probability of a particle 
configuration is expressed as: 

N 

/n =Pn{m,---,Vn) Yl Pl{Vi\Vi-n+l,---,Vt-l) (5) 



i—n+1 



where 



75/1 \ Pn{Vi~n+l , ■ ■ ■ , Vi) 
Pl{Vi\Vi-n+l, ■ ■ ■ ,111) = 7 T (6) 



Pn-liVt- 



n+1, ■ 



describes the conditional probability of finding rji state at 
the ith site {n < i < N) if the cluster configuration in the 
previous n — 1 points is given. The above expression is 
self-consistent in the sense that it satisfies the condition 
of translation invariance [see Eq. (0)] for fc < n. To check 
it the summation should be performed with respect to 
the first and/or last site variables step by step using the 
expressions (||) and (||). 

Choosing the Boltzmann constant to be unit the en- 
tropy is 



{'?.} 



N 



(7) 



where the summation runs over all the possible configu- 
rations. Substituting Eq. (|^) into (|^) one obtains that 

- ^ /jvlnp„(77i, . . . ,?7„) 

N 

\vi-n+i,- ■ ■,m-i) ■ (8) 

{Vi} i=n+l 

According to Eq. (|l|) this expression may be simplified 
and in the thermodynamic limit the entropy per lattice 
sites is 



s = lim — 

Af^oo TV 



(9) 



= - E Pn(?7l, ■ • ■ ,??n)ln 



PriiVl, ■■■,Vn) 
Pn~l{rn, ■ ■ ■,Vn-l) 



As shown by Woodbury [Q this quantity is equivalent to 
the conditional entropy introduced in information theory 
pOf . According to Eqs. (|^) and (||) the above entropy 
may be written in the following form: 



(10) 



where 



— 



Pn(?7l, ■•■,?7«) In Pn(77l,-- ■,'?«) • (11) 



These expressions are very convenient for the CVM. 

Using the techniques mentioned in the Introduction 
the entropy of the one-dimensional system was previously 
derived by several authors Here it is worth men- 

tioning that the pair {n = 2) approximation reproduces 
the exact solution for the one-dimensional Ising model 
with nearest-neighbor interaction. 

The present derivation is based on the fact that the 
configuration probability given as product of conditional 
probabilities is self-consistent, i.e. it satisfies the Eq. (|^). 
The generalization of this approach for higher dimensions 
is not trivial, approximations are required as it will be 
shown in the following section. 



III. SQUARE LATTICE 

In order to explore the difficulties arising on a square 
lattice we consider first the probability of a configura- 
tion constructed on the analogy of the one-dimensional 
system using square cluster configuration probabilities. 
In a translation invariant system we introduce the ixmc- 
tion p4(?7a, '7b, ?7c, ?7d) (clockwise labelling) characteristic 
to the probability of any four-point configurations on 
sites a,b,c and d forming a square with nearest neighbor 
bond sides. From these quantities the three-, two- and 
one-point configuration probabilities (on the correspond- 
ing subclusters) can be derived on the analogy of Eqs. 
(^ and (H). Henceforth we restrict ourselves to systems 
having fourfold symmetry. In this case the three- and 
two point configuration probabilities are independent of 
the cluster orientation. 

The system with L x M sites can be covered by 
(L — 1) X [M — 1) overlapping squares. Following this 
covering procedure the /at function is built up from 
PAiVaiVbiVciVd) functions. To visualize this calculation 
a graphical representation of this product is introduced 
as displayed in the subsequent figures. Here the squares, 
triangles, solid lines and closed circles represent p4, ps, 
P2 and pi functions in the numerator with site variable 
arguments corresponding to the positions. If these quan- 
tities appear in the denominator then the above symbols 
will be plotted by dashed lines or open circles. The size 
of the closed (or opened) circles refer to the exponents 
of pi functions which may differ from 1 (or -1) in the 
examples investigated. 
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FIG. 1. Graphical representation of an intermediate stage 
when constructing configuration probability as a product in 
four-point approximation. Squares and closed circles repre- 
sent four-point and one-point cluster configuration probabili- 
ties in the numerator with arguments corresponding to their 
position. Dashed lines denote two-point configuration proba- 
bilities in the denominator. 

According to a direct (naive) way the addition of the 
site variable rja to the product as shown in Fig. ^ is per- 
formed via the following conditional probability: 



Pii'na\Vb,Vc,Vd) 



PiiVa,Vb,Vc,Vd) 
P3{Vb,Vc,Vd) 



(12) 



Unfortunately, this construction results in a configura- 
tion probability /n which is not self-consistent. This 
difficulty may be circumvented if all the functions ap- 
pearing in the expressions are approximated as 

aj flbT Vct Vd) 



P2iVb,Vc)P2iVc,Vd) 
PliVc) 



(13) 



This situation is illustrated graphically in Fig. |^. Within 
the framework of this approximation the self-consistency 
of the resultant /jv is easily checked because the sum- 
mation defined in Eq. (|l^) can be executed graphically 
too. As a result the square transforms into an "angle" 
built up from two p2 and a pi functions. For example, 
the summation over ry^ in Fig. |l] removes the abed square 
as well as the touched dashed lines and bullet. This pro- 
cedure may be repeated for all the rji variables belonging 
to "free" site of a square until only the desired square 
remains on the screen. As a consequence the entropy 
per sites may be evaluated on the analogy of the one- 
dimcnsional calculation. Neglecting boundary effects in 
the thermodynamic limit it obeys the following form: 



(14) 



where the Sn quantities are defined on the analogy of 
Eq. ( prj ) and the symmetries mentioned above are taken 
into consideration. This formula is equivalent to those 
derived by Kramers and Wannier H. 



The three-point ( "angle" ) approximation suggested by 
Kikuchi |Q] can also be reproduced by the present ap- 
proach. In this case the lattice is covered with triangles 
as shown in Fig. |^. 




FIG. 2. In three-point approximation the configuration 
probabilities are built up as a product of ps functions (tri- 
angles) divided by pi terms where the value of i/ = 1, 2 is 
denoted by the size of open circles. 

The condition of self-consistency is fulfilled if the two- 
point configuration probabilities are approximated in the 
mathematical manipulations as 



P2{Vb,Vc) =^P3{Va,Vb,Vc) ^Pl{Vb)Pl{Vc 
P2{Va,Vc) ^^P3{Va,Vb,Vc) = Pl{Va)Pl{Vc 



(15) 
(16) 



where we used the notation indicated in Fig. ^ and similar 
formula is assumed for P2{Vaiilb)- On the basis of Fig. |^ 
we can construct /n and the entropy per lattice sites is 



s = 53 - 2Si 



(17) 



Notice that in this approximation /jv does not refiects 
the fourfold symmetry assumed in the system in compar- 
ison with those represented graphically in Fig. |l|. This 
approximation takes into account all the pair correlation 
viapa, however, only a quarter of the possible three-point 
correlations is handled. 

The reproduction of pair approximation is worth men- 
tioning for later convenience. One can image that in the 
above three-point approximation the ps function is con- 
structed from p2 functions as 



P3iVa,Vb,Vc) 



P2iVa,Vb)P2{Vc,Va) 
PliVa) 



(18) 



To satisfy the condition of self-consistency one has to use 
the following approximation: 



^ P2iVa,1lb)P2{Vc,1la) 



PliVa) 



= PlMPliVc) 



(19) 
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The graphical representation of the product construction 
is illustrated in Fig. ^. The entropy obtained agrees with 
the result derived by Bethe B. 



where a = A, B and fourfold symmetry is assumed. 



IV. TETRAHEDRAL SITES IN BCC LATTICE 




In the BCC lattice the interstitial atoms are positioned 
at the tetrahedral sites exhibiting a non-Bravais lattice. 
These sites can be divided into six (equivalent) sublat- 
tices labelled by a = 1, . . . , 6 (see Fig. 0). 
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FIG. 3. Graphical representation of configuration probabil- 
ity in pair approximation. Solid lines represent two-point con- 
figuration probabilities in the numerator. The sizes of open 
circles refer to the exponents (1, 2 or 3) of pi functions in the 
denominator. 

We can choose larger clusters to increase the accuracy 
of CVM. The above covering technique is required to 
have a translation invariant constructed as a product 
of conditional probabilities. The main problem is how 
to construct the denominator of the conditional proba- 
bility characteristic to the probability of configurations 
on the overlapping region. In general the following rule 
of thumb seems to be useful for finding the approxima- 
tions required to satisfy the condition of self-consistency. 
The configuration probability on the overlapping region 
should be constructed from the largest subclusters of the 
present clusters. This idea works well for k x fc-point 
clusters. The graphical representation becomes confused 
if one chooses large non-compact clusters. At the same 
time, the numerical solution is also very complicate for 
large clusters. 

The generalization of the present approach to other 
(two- or three-dimensional) Bravais lattices is straightfor- 
ward. As the simplest example for a non-Bravais lattice 
one can study the chess-board like sublattice (antiferro- 
magnetic) ordering on the square lattice. In this situa- 
tion the lattice is divided into two (A and B) interpene- 
trating sublattices with different average occupations (or 
magnetizations). This system can be described by intro- 
ducing two types of four-point configuration probabili- 
ties, namely (?7a, r?b, Vc Vd) and pf{r]a, tlb, Vc m) where 
the upper indices denotes the sublattice to which site a 
belongs. The entropy per sites is derived by repeating 
the above calculation with using alternately p"^ and 
"squares" during the covering. The result is similar to 
those given by Eq. (|lj), namely 



(20) 



:7V 



FIG. 4. Sublattice structure of tetrahedral interstitial sites 
(small spheres) in BCC lattice (large spheres). 

In the six-sublattice mean-field approximation the con- 
figuration probability is defined as a product of the cor- 
responding one-point configuration probabilities Pi{r]i). 
This approximation is evidently self-consistent and re- 
sults in the following entropy (henceforth normalized by 
the number of host lattice points) : 



(21) 



This approximation is used previously to demonstrate the 
possibility of two subsequent phase transitions during the 
ordering process in Agl This method considers all 

the phases conserving translation symmetry of the host 
lattice. The cubic symmetry of these states is broken 
when the sublattice occupations are different. 

In order to have a more accurate description of these 
states we have to distinguish three different four-point 
clusters positioned on the faces of a cubic cell. The prob- 
abilities of the configurations in these clusters are de- 
noted by (?7a, Tlb,Tlc,Tld) whcrc (3 = x,y,z refers to the 
face on which the square is positioned. When the parti- 
cle configuration is built up cube by cube as a product 
of conditional probabilities then we add 12 new site vari- 
ables to the system. We choose these variables to be 
located on the three connecting faces (see Fig. ^). The 
connections toward the touched cubes are taken into ac- 
count by the squares standing out the cubic cell plotted 
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in Fig. H. In fact, these 18-point clusters are used for 
covering. Now, however, the probabihty of the configu- 



rations on these clusters are constructed from and p" 
functions as represented graphically in the figure. 




FIG. 5. Graphical representation of the conditional prob- 
ability when adding a cubic cell to the system. Squares and 
open circles denote P4 and pf functions in the numerator and 
denominator. 

The open circles at the corners of the outstanding 
squares denote that the configuration probability on the 
overlapping region is approximated by a product of p" 
functions in accordance with the rule of thumb men- 
tioned above. The Jn function obtained is self-consistent 
if a product Pipf is substituted for the two-point con- 
figuration probabilities appearing during the summation 
process, for example: 



(22) 



where a and a' refers to the corresponding sublattices. 
After some algebraic manipulation the entropy is given 
as: 



(23) 
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One can choose different ways of covering for obtain- 
ing the same result. For example, first the (x,?/,0) sites 
are covered by distinct horizontally oriented squares {pX)^ 
then we add all the vertical squares {p% and p\) to the 
system having two sites in the (x,?/, 1/4) plane, and so 
on. 

Instead of suggesting other approximations for this lat- 
tice structure, in the subsequent section we demonstrate 
how the present method can be used for such a lattice 
exhibiting two types of interstitial sites. 



V. INTERSTITIAL SITES IN FCC LATTICE 

In the FCC lattice there are one octahedral and two 
tetrahedral interstitial sites per lattice points as illus- 
trated in Fig. ^ These sites can be divided into three 
(FCC) sublattices. The three-sublattice formalism al- 
lows us to study a rich variety of states conserving the 
translation invariance of the host lattice. 




FIG. 6. The three-sublattice structure of tetrahedral (small 
spheres with two gray scales) and octahedral (middle spheres) 
interstitial sites in FCC lattice (large dark spheres). 

Notice that in this non-Bravais lattice the octahedral 
sites are positioned at every second centers of the cubes 
appearing in NaCl-type structure of the tetrahedral sites. 
This feature inspires the choice of simple cubic and body- 
centered-cubic clusters (see Fig. ^ for the construction 
of /at. 




FIG. 7. Cubic (nine-point) cluster on interstitial sites in 
FCC lattice. The figures on the spheres represent site indices. 

The condition of self-consistency is fulfilled if we use 
the following approximations during summations: 
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X] P^i'^o, . . . ,??8) = ^Psivo, ■■■,m)= (24) 

Pi iVO ,m,V4,V5 )P4 (?70 : ^71 7 ??2 , ??3 )P4 {Vo,V2, V4 , V6 )Pl{Vo) 
P2 iVO , m )P2 {V0,V2 )P2 {V0,V4) 

and similar expressions are assumed when instead of rjj 
the sum runs over rji where i < 6. Here the site in- 
dices of the arguments of ps agree with those shown in 
Fig. excepting 8. The same formulae will appear in the 
denominators of the corresponding conditional probabil- 
ities. Consequently, the entropy is 

s = ^9 + 58 - 6^4 + 65f "^'^ - '5^'^ - 5f (25) 

where the upper indices in the 5„ functions indicate sub- 
lattices to which the cluster sites belong if it is necessery. 
In this expression the cubic symmetry is taken into con- 
sideration. 

Despite the symmetries of the above cubic clusters the 
number of parameters will be large leading to difficulties 
in the numerical calculations. The number of parameters 
is drastically reduced if the cubic clusters (as well as the 
whole system) are built up from smaller ones. In the sim- 
plest case we can introduce a triangle cluster formed by 
"nearest-neighbor" sites belonging to different sublattices 
(e.g. sites 0, 1 and 8 in Fig. 0). Suppose that ^3(770,776, 77c) 
denotes the probabilities of configurations on these clus- 
ters independently of its orientation, where rja, rji, and 
77c belong to the octahedral and tetrahedral sublattices, 
respectively. This choice is very convenient because we 
have only seven parameters for the variation technique. 
The configuration probabilities on the nine-point cluster 
shown in Fig. ^ is expressed by these quantities in such a 
way that the edges will be represented with equal weight, 
that is 

f N P3{Va,V0,Vl)P3{V8,V3,Vl)P3{V8,V5,Vl) 

P9(Vq, ■■■,Vs)^ 27 ^ 

Pim.m) 

^ P3{VS,V3, m )P3 iVS ,V3,V2 )P3 {Vs ,V6,V4) 

pl{v8,vi)pi{m) 

where p2 and p2 are distinguished because their sec- 
ond arguments belong to different tetrahedral sublat- 
tices. The whole system may be covered by these body- 
centered-cubic clusters overlapping each other at single 
tetrahedral sites. In this approximation the entropy is 
given as 

-Sf^~3Sr^-3Sr^ (26) 

This triangle approximation, however, is not definite. 
Another result may be obtained on the basis of Bethe's 
method as follows. The elementary cell of this non- 
Bravais lattice has three sites chosen to form a triangle 
cluster defined above (see triangle abc in Fig. ||). 




FIG. 8. Positions of ps functions connecting the triangle 
abc to its nearest neighbors having the same orientation in 
case of triangle-triangle pair approximation. 

Now this triangle is considered as a single site variable 
with 2^ possible values and the triangle-triangle pair cor- 
relations are handled on the analogy of pair approxima- 
tion. For example, the configuration probability on the 
triangle pair abc-def is defined as & P3P3P3 / {P2P1) prod- 
uct. Due to the FCC symmetries each triangle has twelve 
nearest neighbors. In this case, only those triangle- 
triangle pairs are considered which are connected to each 
other through a single triangle cluster as illustrated in 
Fig. 1^. As a result the entropy obeys the following form: 

s = 6 53 - 25f -^^^ - 25f - 5f 

-5f)-25r)-25r) (27) 

which reproduces Eq. ( |2^ ) if pi (77^)^1 (77c) is substituted 
for P2{Vbi'ilc)- Evidently, the approximations required by 
the self-consistency are different in these situations. The 
latter formula is used to check the mean- field phase dia- 
grams of a lattice-gas model suggested to describe the or- 
dering processes in alkali- fuUerides The systematic 
comparison of the possible approximations goes beyond 
the scope of the present work. 

VI. CONCLUSIONS 

The derivation of entropy for cluster techniques is re- 
formulated by showing how to construct the configura- 
tion probabilities as a product of conditional probabili- 
ties when building up the system from overlapping clus- 
ters. In one-dimensional system this product is obvi- 
ously self-consistent. For higher dimensional systems, 
however, some approximations are required to fulfill the 
self-consistency which is relevant when expressing the en- 
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tropy per sites as a conditional entropy introduced in in- 
formation theory. 

A graphical representation of the product is suggested 
to make the calculations treatable. Using this technique 
we have reproduced some traditional results derived on 
square lattices. Due to its simplicity, this approach is eas- 
ily applicable to complicated lattices. Using this method 
entropy expressions are derived on two non-Bravais lat- 
tices, formed by the interstitial sites of BCC and FCC 
structures, which may be useful for the investigation of 
many real systems. 

In fact, the present derivation of entropy is equiva- 
lent to combinatorial calculations. However, it provides 
a better understanding of the approximations applied. It 
is emphasized that these methods assume merely trans- 
lation invariance despite some other techniques based on 
the formalism of equilibrium statistical physics. As a con- 
sequence the entropy expressions remain valid for those 
non-equilibrium systems whose stationary states satisfy 
this condition. 
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